CDS

Accession Number TCMCG023C23756
gbkey CDS
Protein Id PIN03478.1
Location complement(join(37248..37434,37515..37589,38373..38438,38981..39053,39186..39225,39471..39543,40295..40423,40633..41411,42125..42218,42291..42331,42426..42549,42659..43017))
Organism Handroanthus impetiginosus
locus_tag CDL12_23999

Protein

Length 679aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA324125, BioSample:SAMN05195323
db_source NKXS01005511.1
Definition hypothetical protein CDL12_23999 [Handroanthus impetiginosus]
Locus_tag CDL12_23999

EGGNOG-MAPPER Annotation

COG_category L
Description DNA binding domain with preference for A/T rich regions
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03021        [VIEW IN KEGG]
KEGG_ko ko:K15200        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGAGGAGGAAGAAGGACCAGTGGAATTTGACGAAGACGCTCCACTAGCCTTGGCACTTACGCGAAAGAGAAAAGTAATCAAGAAGAAGAAAGAACCTCATGTGATTCCCAAGAAGAATGCTCCGCCCAACAAAGCAACGCAGGCGGCTCACAATCCTGCGCCTGCCCAATTGACTGGCGACGCTGAACATCCCACAACCAACGCTTCACCGGAGATTGCAGTTTCTGAATTTGATTATTCCATCGAGAATCACTTTAAAGCCGTTGATACAATAACCAAACTTTGTGGACACCCTACAACACTCGATGCTAACCATGTGGAAATTAAACGACTCTCCAATTCTATTACTTTCTTGAGAGAGTGGAGGGATTTTAAATATGCTCCACGAACTGTGATTTTTGCTTCTCAACATAATTCTCAGAAGAAAGATGTAGTAGCTGGAGTTACCTTATCTCAATTTTCAGCTGCATCTGTTCCCAAGAAAGAGATGCAAAATGGGATTGAAGCTGTTACAGAATCTAGCAAAGACTTTGTGATGCATGTTGGGGGTTCAGTCTGGGCATTAGATTGGTGTCCTAGCGTTGACTGCAATTCTGAGAATCCCATTAAGTCCCAGTTCATTGCTGTTGCTGCTCATCCTCCTGAATCTTCATATCACAAGATTGGTGCCCCTTTAACTGGCAGAGGTGCTGTCCAGATATGGTGTTTGTTAACTGTTTCTGTTAAAGAAGATGTACTTTCTCTGGGGAATAAAAAGTCAAGACAAAATTCCCAGAACAAGCTTGCTAAACTACCGAAGCCAAGAGGGAGACCAAGAAAGAAACCGCTTAATGACTCTGTAGAGAAGATGGATACTGATAGCCAGTCTGTACAGCCTCTTGCTGTTGAATACCCTATGGGCTTGCCTGGTTCTCATTCTTCAGGTAGAATTTCCAGCAATTCAAGTAAGCATTTTTATGAGGAAGAATCTATGAGAACTCATATGGATTCTAACCAAACAGAATATGCAAATGCACCCTTGTTGTCGGCTCCTAAAGGCAGACGAAGTAAAGCAAAGGCAGGGAAAGAAGACCAGGTTCAGAATGACAGTTTGCATATTCTGAGACAATGTGAACAGGGAGAATCTGCAATTGTGAATCCAGTGGCATCTGCAAGCTTGGGTTTAGATTCCATGAGTGCCGACAAGAATATAACTTGTATTAGTTTTGGTGATGCTAATACTTCTGAGAACTTAGTTCCCATGGATGTTGCATTGCCTAGAATGATGCTATGTTTAGCTCACAATGGAAAGGTTGCATGGGATATAAAGTGGCGGCCAGTTAATGCTTGTGATCCTGAACCCATGTGTACAATGGGTTTTCTTGCTGTGTTGCTTGGAAATGGTGCTCTTGAAGTGTGGGAGGTCCCTCTTCCCCATACAGTGAAACTTGTTTATCCTGCTTGCCAGGAACGTATTGATCCTCGCTTCATCAAATTGAAACCAGTGTTCAGATGTTCAAGGTTGAAGTGCGGTGATAGGCAAAGTATACCTTTAACATTGGAGTGGTCTACGGCATCTCCTCATGATATGATTTTGGCCGGATGTCATGATGGAGTGGTTGCTTTGTGGAAGTTTTCAATTACTGATTCATTAACAGAAACTGGGCCTTTGCTTTCCTTCAGTGCAGAGACTGGTCCTATTAGAACACTGGCCTGGGCACCAATTCAAAGCGATCTTGAGAGTGCAAATGTGATTATCACTGCGGGTCAGAAAGGCTTTAAGTTTTGGGACATACGTGATCCATTCCGTCCGTTGTGGGACCATGCTATTCCTGCGACCACTTATGGGTTGGATTGGCTGCTAGATCCGAGATGTGTCTTCGGATCTCTTGATGATGGAATGTTGTGGTTGCTCAACTTAGAGAAGGCAACACAAGATATCCCAGTTGCTGGAAAATCTCTCTCTGTGGCACCAAAGCACGGATTCCACTGTTTTGACTGTTCCCCATTCTCAATATGGAGTGTTCAAGCTTCACGACTGACAGGTTCCTTGTTATAA
Protein:  
MEEEEGPVEFDEDAPLALALTRKRKVIKKKKEPHVIPKKNAPPNKATQAAHNPAPAQLTGDAEHPTTNASPEIAVSEFDYSIENHFKAVDTITKLCGHPTTLDANHVEIKRLSNSITFLREWRDFKYAPRTVIFASQHNSQKKDVVAGVTLSQFSAASVPKKEMQNGIEAVTESSKDFVMHVGGSVWALDWCPSVDCNSENPIKSQFIAVAAHPPESSYHKIGAPLTGRGAVQIWCLLTVSVKEDVLSLGNKKSRQNSQNKLAKLPKPRGRPRKKPLNDSVEKMDTDSQSVQPLAVEYPMGLPGSHSSGRISSNSSKHFYEEESMRTHMDSNQTEYANAPLLSAPKGRRSKAKAGKEDQVQNDSLHILRQCEQGESAIVNPVASASLGLDSMSADKNITCISFGDANTSENLVPMDVALPRMMLCLAHNGKVAWDIKWRPVNACDPEPMCTMGFLAVLLGNGALEVWEVPLPHTVKLVYPACQERIDPRFIKLKPVFRCSRLKCGDRQSIPLTLEWSTASPHDMILAGCHDGVVALWKFSITDSLTETGPLLSFSAETGPIRTLAWAPIQSDLESANVIITAGQKGFKFWDIRDPFRPLWDHAIPATTYGLDWLLDPRCVFGSLDDGMLWLLNLEKATQDIPVAGKSLSVAPKHGFHCFDCSPFSIWSVQASRLTGSLL